arXiv: 1506.0593 lv2 [astro-ph.GA] 5 Aug 2015 


Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed Thursday O*** August, 2015 (MN style file v2.2) 


A blind HI Mass Function from the Arecibo Ultra-Deep 
Survey (AUDS) 

L. Hoppmann^’^*, L. Staveley-Smith^’^, W. Freudling^, M. A. Zwaan^, 

R. F. Minchin'^, M. R. Calabretta^ 

^International Centre for Radio Astronomy Research (ICRAR), MfdS, University of Western Australia, 35 Stirling Hwy, Crawley, WA, 6009, Australia 
^ARC Centre of Excellence for All-sky Astrophysics (CAASTRO) 

^European Southern Observatory (ESC), Karl-Schwarzschild-Str. 2, 85748 Garching, Germany 

‘^Arecibo Observatory, HC03 Box 53995, Arecibo, PR 00612, USA 

^CSIRO Astronomy and Space Science, PC Box 76, Epping, NSW 1710, Australia 


Thursday fi*** August, 2015 


ABSTRACT 

The Arecibo Ultra Deep Survey (AUDS) combines the unique sensitivity of the tele¬ 
scope with the wide field of the Arecibo L-band Feed Array (ALFA) to directly 
detect 21 cm H I emission from galaxies at distances beyond the local Universe 
bounded by the lower frequency limit of ALFA {z — 0.16). AUDS has collected 
700 hr of integration time in two fields with a combined area of 1.35 deg^. In this 
paper we present data from 60% of the total survey, corresponding to a sensitiv¬ 
ity level of 80 /iJy. We discuss the data reduction, the search for galaxies, paramet- 
risation, optical identification and completeness. We detect 102 galaxies in the mass 
range of log(MHi/M) — 21og/i = 5.6 — 10.3. We compute the Hi mass function 
(HIMF) at the highest redshifts so far measured. A fit of a Schechter function res¬ 
ults in a = -1.37 ±0.03, = (7.72 ±1.4) x IQ-^h^Mpc-^ and log(M^,/Mo) = 

(9.75 ± 0.041) ± 2\ogh. Using the measured HIMF, we find a cosmic Hi density of 
Hhi = (2.33 ± 0.07) X 10“^ h~^ for the sample (z = 0.065). We discuss further uncer¬ 
tainties arising from cosmic variance. Because of its depth, AUDS is the first survey 
that can determine parameters for the H I mass function in independent redshift bins 
from a single homogeneous data set. The results indicate little evolution of the co¬ 
moving mass function and Hhi within this redshift range. We calculate a weighted 
average for Hhi in the range 0 < z < 0.2, combining the results from AUDS as well 
as results from other 21 cm surveys and stacking, finding a best combined estimate of 
Hhi = (2.63 ± 0.10) X 10-4 /i-4. 

Key words: galaxies: evolution - galaxies: ISM - galaxies: luminosity function, mass 
function - radio lines: galaxies - surveys 


1 INTRODUCTION 

Understanding how and at what rate stars form from cool 
atomic and molecular gas (< lO^K) is one of the crucial 
questions of modern astrophysics. The star formation rate 
(SFR) is well measured from UV, optical, infrared and ra¬ 
dio continuum observations, and is found to increase by an 
order of magnitude over the redshift interval of 0 < 2 < 2.5 
(Hopkins & Beacom 20061. By comparison, the evolution 


of the atomic and molecular cosmic gas density appears to 
be less dramatic with recent galaxy evolution models sug¬ 
gesting that there may only be a weak evolution of cosmic 
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gas density at 2 < 2 if there exists a self-regulated equilib¬ 
rium between the inflow of gas into galaxies and the SFR, 
(Obreschkow & Rawlings 2009| Lagos et al. 2011 Power 
et al.poTO ). However, better observations are necessary to 
further develop these models and to better understand the 
balance between gas accretion, star formation and feedback. 


Measurements of the H 2 density (e.g. Keres et al.|2003' I 
are unfortunately not easy, as the molecule does not possess 
a low-energy rotational transition. Instead, we are depend¬ 
ent on the use of CO as a proxy, with its uncertain depend¬ 
ence on optical depth, metallicity and the radiation field. 
More accurate observations are available for atomic hydro¬ 
gen via Damped Lyman-a (DLA) systems at high redshift 
or via the 21 cm line in the local Universe. 
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DLAs are wide absorption features caused by high 
column densities of H l (> 2 x 10^'^atoms cm“^) normally as¬ 
sociated with galaxies. DLAs can be observed against bright 
background sources such as QSOs and appear to repres¬ 
ent objects which contain the majority of H l at redshifts 
1.6 < z < 5.0. ( Wolfe et al. poo's I and are therefore an im¬ 
portant reservoir for star formation. 

DLA measurements at z ~ 4 find values of Dh i around 
double the local value (|Zwaan et al.|2005[ ). |Prochaska et al.| 
(20051 and Prochaska & Wolfe ( 2009| find a 50% decrease to 
occur at lower redshifts 2.3 < 2 < 5.5. However, Noterdaeme 
et al. (2012) use a larger Sloan Digital Sky Survey (SDSS) 
Data Release (DR) 9 sample and only find a 20% decrease 
over a similar range. Measurements using Mgll systems at 


low-redshift DLA proxies are consistent with the latter (Rao 


et al. 20061, though selection effects are uncertain and er¬ 


rors are high. Nevertheless evolution in Dhi above redshift 
2 > 0.5 appears significantly lower than the corresponding 
evolution in the star formation rate. 

Interpretation of 21 cm observations is more robust, but 
sensitivity considerations mean that observations are mainly 
limited to the local Universe. Examples of 21 cm surveys are 
summarised in Table Extensive mapping of the sky has 
been done since the installation of multi-beam receivers on 
the Arecibo, Parkes and Effelsberg telescopes, which trans¬ 
formed these telescopes into powerful survey facilities. This 
resulted in a significant increase in the area surveyed and the 
number of galaxies detected. In terms of redshift, however, 
these surveys are still limited to 2 ~ 0. The two largest 
H I surveys are the H l Parkes All-Sky Survey (HIPASS) 
(2 < 0.04) ( Meyer et al.||200'4j Wong et al. 20061 and the 


Arecibo Legacy Fast ALFA (ALFALFA) (2 < 0.06) (Giovan- 
elli et al.|2005 f HIPASS detected 5,317 galaxies in the south¬ 


ern and the northern sky up to a declination of <5 = -1-25.5. 
ALFALFA observed an area of ~ 7000 deg^ with the target 
of detecting about 30,000 galaxies at 21 cm. 

Direct detections of galaxies beyond the local Universe 
are not only limited by sensitivity, but also by radio fre¬ 
quency interference (RFI) and receiver bandwidth. Target 
galaxies for deep 21 cm observations are therefore normally 
preselected. For example, Catinella et al. (20081 targeted 
galaxies up to 2 = 0.25 to look for the most H l-massive 
objects, selecting them by their H-a emission. |Zwaan et al.| 
(20011 and Verheijen et al. (20071 targeted clusters with red¬ 
shifts about 2 « 0.2 to increase the chance of detection. Such 
a strategy leads to samples that are biased towards galaxies 
with high surface brightness in optical bands. Furthermore, 
in order to extend beyond the local Universe, such surveys 
need to be very sensitive. With that in mind, we commenced 
the Arecibo Ultra Deep Survey (AUDS) - a blind 21 cm sur¬ 
vey with the Arecibo L-band Feed Array (ALFA) to search 
for 21 cm H l line emission at redshifts between 0 and 0.16, 
the limit of the receiver. The AUDS precursor observations 
(Freudling et al. 20111 were an important test of the feas¬ 
ibility of such a survey. The precursor survey detected 18 
galaxies in the redshift range 0.07 < 2 < 0.16 with a total 
integration time of 53 hr. While this provided a measure¬ 
ment of Dhi in good agreement with measurements in the 
local Universe, the AUDS precursor was limited to a very 
small region with few detections and covered only a limited 
range of redshifts. Small number statistics and cosmic vari¬ 
ance were therefore problems, resulting in large errorbars. 



Figure 1. Idealised relative positions of the seven independent 
ALFA beams. For AUDS the beams are rotated between 15° and 
23° to achieve a uniform coverage of the survey area. 


In this paper, we present results of 60% of the data from 
the full survey. It is a fully sampled and a significantly more 
sensitive data set than the precursor observation, providing 
a larger sample of direct 21 cm detections. The total integ¬ 
ration time is eight times larger than that of the precursor 
survey. This paper allows a preliminary release for galaxies 
so far detected, and makes significant advances in the under¬ 
standing of the evolution of the H l mass function (HIMF). 

Throughout this paper we use Ho — 
100ft kms“^ Mpc Dm = 0.3 and Da = 0.7. 


1.1 Survey Strategy 


The primary goal of AUDS is to systematically survey the 
cosmic HI density (Dh i) in a volume that is beyond the local 
Universe to a sensitivity that has not been probed before. 
The strategy of the survey is therefore orthogonal to other 
current single-dish H l surveys in the sense that AUDS covers 
a very small area on the sky using the most sensitive 21 cm 
system currently available and using a very long integration 
time. Our goal was to achieve an exposure time of about 
40 hr per pointing, and a total of 1000 hr of observing time 
(including overheads) were assigned to this project. To cover 
the field in the most uniform and sensitive way, we used 
the “drift and chase” mode which we extensively tested and 


refined during our precursor observations (Freudling et al. 
20111. The basic strategy was to carry out repeated drift 


scans over the same field. To optimise uniformity of the sky 
coverage, the feed array was rotated to ensure equal spacing 
between the beams. Due to the elliptical projection of the 
array onto the sky the rotation angle varies between 15° 
and 23°. The orientation of the array relative to the scan 
direction is shown in Figure 

One major difference with the precursor observations 
was that, with significantly more observing time available, 
we were able to Nyquist-sample the sky using short adja¬ 
cent drift scans that were offset in declination by 0.1 of the 
beam size. This enabled much more accurate determination 
of source positions (limited only by signal-to-noise ratio) and 
source fluxes. 

All observations were carried out at night time to 
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Table 1. Selection of major blind 21 cm spectral line surveys. 


Survey 

Reference 

Telescope 

N 

Area (deg^) 

Redshift 

AHISS“ 

Zwaan et al.| 11997 i 

Arecibo 

66 

65 

0-0.025 

ADBS“ 

Rosenberg!: & Schneider 12002 i 

Arecibo 

265 

430 

0-0.027 

HIPASS“ 

Meyer et al. (j2UU4 i 

Parkes 

4315 

21341 

0 - 0.042 

North. HIPASS“ 

Wong et al.l (12006 i 

WSRT 

1002 

7997 

0 - 0.042 

40% ALFALFA^ 

Haynes et al. ll201lll 

Arecibo 

10,119 

2799 

0-0.06 

ALFALFA*’ 

Giovanelli et al. (2005 i 

Arecibo 

~ 25000 

7000 

0-0.06 

AGES*’ WAPP 

Auld et al. (2006 

Arecibo 

~ 1300 

105 

0-0.06 

Mock 

CHILES*’ prec. 

Fernandez et al. l|2013 \ 

VLA 

~ 1300 

33 

95 

0.3 

0-0.16 

0-0.193 

ALFA ZOA*’ shallow 

Henning et al. 1 20lO > 

Arecibo 

~ 500 

1000 

0-0.06 

deep 

AUDS“ prec. 

Freudling et al. (20111 

Arecibo 

~ 1500 

18 

280 

0.069 

0-0.16 

0.07-0.16 


“ Observations completed and survey completely published. 

^ Observations ongoing or data not completely published - survey results not finalised. 


achieve the best baselines and lowest system temperature. 
In order to be able to schedule observations for the pro¬ 
ject throughout the year, we elected to divide the total area 
into two fields at opposite right ascension within the SDSS 
footprint. The size of the individual field areas was determ¬ 
ined by science goals and telescope limitations. A small area 
is required to go deep. On the other hand, the upper red- 
shift limit of ALFA means that once sufficient sensitivity to 
low-mass galaxies is obtained, it is then better to go wide. 
Edge effects such as loss of sensitivity make it inefficient to 
consider fields which are smaller than half a square-degree in 
size. Telescope start-up delays make it inefficient to consider 
drift scans much shorter than a degree in length. 


1.2 Target Selection and Observations 

The goal of AUDS is to carry out an unbiased, sensitive sur¬ 
vey outside the local Universe. Our original goal was to be 
able to detect galaxies with ~ of neutral hydrogen 

at the frequency limit of the ALFA receiver (/ = 1225 MHz). 
In order to sample independent volumes, we chose to observe 
two independent fields that contain no known clusters. Be¬ 
cause of the long necessary integration times, the surveyed 
fields are necessarily small ~ 67' x 44' each, corresponding 
to a total volume of U ~ 10^'®Mpc®. For efficient survey¬ 
ing, the two regions should differ in right ascension as much 
as possible, and be located at a declination where they can 
be tracked for the maximum time possible at Arecibo. An¬ 
other criterion for the selection of our fields was that they 
were within the SDSS survey region. We also tried to avoid 
bright continuum sources as much as possible. The bright¬ 
est continuum source in Field 1 is 43.9 mjy and in Field 2 
196.9 mjy. 

We used a “drift and chase” mode for the observations. 
Each AUDS scan consists of 230 individual spectra for each 
beam and polarisation, with each successive spectrum integ¬ 
rated over 1 s while the telescope covers 1 s in right ascen¬ 
sion. The spectra from the seven ALFA beams were recorded 
using the Mock Spectrometer dividing each spectra into two 
intermediate frequency (IF) sub-bands, each 172 MHz wide. 
The high frequency IF is centred at / = 1450 MHz and the 
low frequency IF is centred at / = 1300 MHz. Combined 
they cover the whole bandpass range of the ALFA receiver 
of 300 MHz with a spectral resolution of A/ = 0.02 MHz. 
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1.3 Data Processing 


The bandpass removal and calibration was done using the 
multi-beam single-dish data reduction software livedat^ 
Details of the steps of the reduction process are described in 


Barnes et al. (20011. As livedata was originally developed 


to reduce data from the Parkes telescope for HIPASS, the 
program was adapted to suit the different settings of the 
Arecibo telescope including the handling of different types 
of FITS files and other Arecibo specific issues such as calib¬ 
ration. 

Each individual spectrum with 1 s integration time has 
a root mean square (RMS) of about 50 mJy. To convert 
the individual spectra into regular gridded position-position- 
velocity cubes we use the software gridzilla^. For the final 
gridded data cubes, data from all beams and polarisations 
were combined. 

Gridzilla calculates the contribution of every indi¬ 
vidual spectrum to each pixel of the grid and calculates the 
final value of the pixel based on the contributing spectra 
and the assigned weights. The weights were determined by 
the distance of spectra from the beam centre. We used a 
weighted median statistic to combine the spectra. The data 
values were first sorted, and their weights were summed. 
The weighted median is then the data value for which the 
sum of the weights is half the total weights. [Barnes et al.| 
(20011 have shown that using a median estimator was very 


successful for HIPASS in removing small amounts (less then 
40%) of bad data (caused by RFI etc) with the downside of 
increasing the noise level by at least 25.3%. 

However, on occasion, the AUDS data show much more 
significant levels of bad data (Figure]^, especially in the 
frequency range of / = 1220 — 1350 MHz, necessitating fur¬ 
ther measures. Several sources of strong RFI, like the radar 
in Punta Salinas and transmissions from the nearby airport 
interfere in this frequency range. Additionally, several satel¬ 
lites (e.g. GPS, Galileo and GLONASS) transmit in this 
frequency range. While the RFI emitted from the radar are 
narrow in frequency and pulsed in time, the RFI from the 
satellites span a wide frequency range, but do not occur at 
all times. 

To further mitigate against the RFI problem, we used 


^ http://www.atnf.csiro.au/computing/software/livedata 
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Frequency / [MHz] 


Figure 2. RFI occupancy plot for AUDS. While the higher fre¬ 
quency regions are only mildly affected by RFI, large regions of 
the lower frequency ranges are completely wiped out. 


a flagging routine applied to the bandpass-corrected data as 
follows: 

(i) The RMS in a RFI-free region is estimated. The RFI 
free region was chosen by examination of several time- 
frequency images. 

(ii) A 3.5cr mask is created where zero corresponds to 
valid pixels and one to masked pixels. 

(iii) The mask is grown by smoothing it with a Hanning 
kernel of the size 5x5 pixel (time-frequency domain) which 
results In values between zero and one. Pixels where the val¬ 
ues of the mask is above 0.1 are flagged and ignored during 
the gridding of the data. 

This process is repeated three times before the data is grid- 
ded into data cubes. An example of the mask and flagging 
is shown in Figure 


2 AUDS SAMPLE 
2.1 Galaxy Catalogue 

To search for galaxies, position-velocity images were 
searched by eye to create a list of galaxy candidates. Due to 
the varying noise and strong RFI at higher redshift, avail¬ 
able automated source finders were not able to create a 
usefully short candidate list. To create the source list the 
cubes were searched by three members of the AUDS team. 
Each tried to identify every possible candidate, even those 
that were unlikely to be significant detections. Together, 294 
unique candidates were identified and a preliminary source 
list was created. In a second step, a single person inspected 
each spectrum and image in position-velocity space from the 
merged list. Spectra were extracted and fitted and a shorter 
source list created based on the final inspection of each can¬ 
didate. It was important to carefully distinguish between 
real galaxies and spurious detections based on their signal- 
to-noise ratio (SNR), their spectral line shape and the shape 


of the detections in the image plane. Special attention was 
taken to distinguish between real sources and RFI. In many 
cases a clear distinction was possible given their very dif¬ 
ferent signatures in the position-velocity plane, RFI often 
being narrow in frequency and visible over large ranges in 
right ascension/declination. However, at low SNR, the dis¬ 
tinction is not as clear. Candidates close to, or overlapping 
with known RFI were treated especially carefully to ensure 
they were real and that the bandpass was fitted correctly. 
This resulted in a shorter list of 133 likely candidates, five of 
which were common between the overlapping bands leaving 
128 candidates. 

We then used the completeness coefficient C, derived 
from the simulation discussed in Section |2.3| to create a 
final source list. C uses the integrated flux Sint, the velocity 
width W and the RMS cr, of each galaxy to calculate the 
probability of detecting a galaxy in the survey. If C > 0 the 
galaxy was retained in the sample. 

The final sample consists of 102 galaxies. A selection 
of spectra of AUDS galaxies is presented in Figure]^ Para- 
meterisation of the galaxies was performed using the task 
mbspect which is par t of the radio astr onomy data reduc¬ 
tion package miriad ( Sault et al.||l995| p] The galaxy posi¬ 
tion estimated from the manual search was more precisely 
determined by fitting a Gaussian to the velocity integrated 
image (0th moment) over a width of 5-9 arcmin around the 
input position assuming the galaxies are point sources. This 
is a good approximation as the beam size at the mean red- 
shift of the galaxies in the sample {z = 0.065) is about 
190kpc in diameter. Additionally we also looked at the 
optical diameter (Petrosian diameter in the r-band) of the 
AUDS galaxies we could cross match to SDSS galaxies (de¬ 


tails are given in Section 2.2 ) and found that 95% of galaxies 
are smaller than 1.2 arcmin. This would allow the Hi disk 
to exceed the optical diameter by up to three times and still 
be within the ALFA beam. 

Next the spectra were optimally extracted, also using 
the miriad function mbspect, at the new position using a 
window of 5-9 arcmin, weighting neighbouring pixels by the 
beam shape. The velocity range occupied by the detected 
galaxy was masked out, before a polynomial was fitted to the 
baseline. The spectral width W and the central velocity of 
the profiles were measured. The fit also provided Sint, Speak 
and the peak SNR. The fluxes were measured assuming the 
galaxies are point sources. Using Sint and the luminosity 
distance Dl of the galaxies we calculate Mh i 


^ = 49.8 (^ 
Mq \ Mpc 


Si, 


JyHz 


( 1 ) 


The H I masses of the AUDS galaxies as a function of their 
distance is presented in Figure 

The bivariant distributions as well as single parameter 
histograms of the galaxy parameters ( 2 , Sint, Speak, W) are 
shown in Figure]^ The redshift histogram shows that we did 
not detect any galaxies with 2 > 0.13. Detecting H l galaxies 
at high redshift proved challenging as the RFI environment 
at Arecibo is very hostile, especially for / < 1290 MHz. 


^ http://www.atnf.csiro.au/computing/software/miriad/ 
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Figure 3. The three images show an example of the RFI flagging process. Upper panel: Example of data from one beam after bandpass 
removal and calibration using livedata. A stack of 230 spectra is shown with 1 s integration time each. Centre panel: The 3a mask created 
in the flagging process. The black regions will be masked out. Lower panel: Data after masking with significantly reduced RFI. 


2.2 Optical Counterparts 

Both AUDS fields were chosen to overlap with the SDSS 
footprint. Searching the SDSS DR7 catalogue we find Field 
1 has 25 galaxies and Field 2 has 56 galaxies with spectro¬ 
scopic redshifts in the redshift range of AUDS. Additionally 
there are 11588 galaxies with only photometric redshifts in 
Field 1 and 9932 galaxies in Field 2. 

We used the SDSS to find optical counterparts for the 
AUDS galaxies and found that 36 out of 102 have spectro¬ 
scopically confirmed counterparts and that at least one of 
the AUDS galaxies is a pair. Optical galaxies are identi¬ 
fied as matches when they are within a two arcmin radius 
around the H l position and the difference with the spectro¬ 
scopic redshift is smaller than 150kms“^. Taking the pos¬ 
ition (inside the beam), the size, inclination and colour of 
the galaxy into account we identify another 18 likely optical 
counterparts. Figure]^ (left panel) shows the offset between 
the position of the optical counterparts as given by the SDSS 
and the positions measured from the H l data for both the 
galaxies with optical counterparts with spectroscopic red¬ 
shifts as well as the galaxies with likely counterparts with 


photometric redshifts. We also show the difference between 
the optical redshift and H l redshift in the right panel of the 
same figure. 

Even though the overall number of galaxies with re¬ 
liable optical information is small, some trends are clearly 
visible (Figure]^. There is a correlation between the detect¬ 
ability of a galaxy and its r-band luminosity as the SDSS is 
a magnitude but not volume-limited sample. This also be¬ 
comes evident when one compares the luminosity of the op¬ 
tical counterparts with and without spectroscopic redshifts. 
Galaxies with higher masses at a certain redshift are more 
likely to have spectroscopic counterparts. 

The second trend is that lower mass galaxies are pre¬ 
dominately blue galaxies (g — r< 0.7) and less likely to have 
spectroscopic information. Redder galaxies are only found at 
large Mh i- This is expected as bluer galaxies tend to be more 
gas rich. Red galaxies on the other hand tend to have lower 
gas mass fractions but are significantly more massive and lu¬ 
minous. That makes them easier to detect in optical surveys 
especially at higher redshift where the survey volume is lar¬ 
ger. This means that non-volume limited H l surveys tend 
to be biased against red galaxies with increasing redshift. 
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Figure 4. Selection of AUDS spectra. The black dashed vertical lines indicate the spectral range in which the line is fitted. This spectral 
region is excluded for the fit of the baseline, marked with the blue dashed line. The completeness coefficient for each galaxy is given in 
the upper right hand corner. The top panels show the three lowest mass galaxies detected in the survey. The central panels shows three 
galaxies at intermediate redshift with high SNR. The lowest panels are the three highest redshift galaxies. 
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Figure 6 . Log-log bivariate distribution of measured parameters of AUDS galaxies: redshift 2 , 50% velocity width W 50 , peak flux Speak 
and integrated flux Sint- On the diagonal the single-parameter histograms are plotted. 
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[arcmin] 



Figure 7. Left panel: Black dots: AUDS galaxies with spectroscopic counterparts, Red diamonds: likely optical counterparts with only 
photometric redshifts. The plot shows the difference between the source position fitted to the AUDS galaxies and the source position 
given in the SDSS catalogue. The grey shaded region is the size of the ALFA beam. Right panel: Histogram of the difference in redshift 
between the measured optical velocity from SDSS and the AUDS for all galaxies with spectroscopic counterparts. 




log Mhi [h-^ Mq] 


Figure 8 . Optical counterparts of AUDS galaxies. Black dots: H i-selected AUDS galaxies with optical spectroscopic counterparts; red 
diamonds: likely candidates with only photometric redshift measurements; cyan stars: galaxies in a pair. Left plot: AUDS is not volume 
limited and therefore we find a correlation between the redshift and the detected luminosity of the galaxy. Less luminous galaxies (at 
a certain redshift) are less likely to have spectroscopic counterparts. Right plot: Lower mass galaxies are predominantly blue galaxies 
{g — r < 0.7). Redder galaxies are only found at larger Mhj. As bluer galaxies tend to be more gas rich they are more easily picked up 
at low masses in comparison to red galaxies which have lower gas mass fractions but are significantly more massive and luminous and 
therefore easier detectable at higher redshifts by optical surveys. 


We leave the more detailed discussion of optical properties 
as well as a stacking analysis for a later paper. 


2.3 Survey Completeness 

The completeness of a survey is defined as the fraction of 
a certain type of galaxy which can be detected by a survey 
from the underlying distribution of objects down to the de¬ 
tection limit of the survey. Estimating the completeness of 
AUDS is a crucial step to understand the underlying galaxy 
population. 

A good way to determine the completeness of the 
sample is to insert synthetic galaxies into the data and test 


the rate with which these galaxies are detected. This al¬ 
lows us to determine a completeness limit as a function of 
5'int as well as W and the single pixel noise a. To take the 
noise into account we defined a noise weighted integrated 
flux S'int = Sint /o'. 

To insert the galaxies we chose three representative 
subregions in the data cube of Field 1: one RFI free re¬ 
gion (1381MHz); one region moderately affected by RFI 
(1325 MHz); and one region with high RFI occupancy 
(1231MHz). Each region is 20MHz wide. In total 356 syn¬ 
thetic galaxies were inserted into the subcubes. The para¬ 
meters for the synthetic galaxies were randomly chosen from 
an integrated flux range of Sint = 0 to 1.5mJyMHz and a 
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Figure 9. ID-slices of the completeness function in different bins of frequency width. The central width of each bin is given in the upper 
left hand corner. The completeness is well described by an error function shifted along the horizontal axis. The grey shaded area indicates 
the completeness for the maximum and minimum width in the bin. Left panel: The completeness as a function of Sint and W. The dashed 
line is the best-fit to the data using equation]^ The red dashed line indicates a nominal completeness limit of Sint = O.SmJy MHz. Right 
panel: The completeness as a function of and W. The dashed line is the best-fit to the data using equation 


width range of VF = 0.12 to 2.0 MHz. Varying the width 
of the galaxies takes both variation due to different rota¬ 
tional velocities as well as different inclination into account. 
The shapes of the galaxy spectra were described by a busy 
function ( [Westmeier et al.||2014[ ). 

The data cubes were searched blindly in the same way 
as the original data cubes and 256 of the galaxies in the mock 
sample could be recovered. From the ratio of the number of 
detected galaxies (rid) to the number of all galaxies (rit) the 
completeness in logarithmic flux and linear W bins can be 
computed. 

The dependency of the completeness on W and S'int as 
well as W and ,shown in Figure resembles an error 

function shifted along the flux axis. We chose to describe the 
completeness function with the following analytic model 


C = max erf aH7“(S'int - /3W - 7 ) , 0 


( 2 ) 


C = max ^erf (ST^t - /3 W - 7 ) j , 0 j . (3) 

The four independent parameters in equation and were 
chosen to fit the features in the binned completeness data 
namely: ( 1 ) the completeness decreases for smaller fluxes for 
all values of W; (2) W influences the steepness of the de¬ 
crease of the completeness to zero (narrow profiles are easier 
to detect) and (3) the flux value at which the completeness 
essentially becomes zero. 

The errorbars in Figure [^indicate the 1 a binomial con¬ 
fidence interval given by nu/nt ± ^ (p(l — p)/nt) = p with 
rid being the number of detected synthetic galaxies in a bin, 
rit the number of all synthetic galaxies in that bin and p the 
upper/lower limit of the confidence interval. Binning data in 
intervals of ASint = O.lmJyMHz and AW — 0.4 MHz and 


Table 2. This table summarises the integrated fluxes at which a 
galaxy with a certain width reaches a completeness limit of 95%, 
90% and 65%. 


Width 

(MHz) 

Width {z = 0) 
(km/s) 


Sint 

(mJy MHz) 



C = 0.95 

C = 0.90 C 

= 0.65 

0.22 

46 

0.76 

0.49 

0.18 

0.62 

131 

0.81 

0.58 

0.28 

1.02 

215 

0.87 

0.65 

0.35 

1.42 

300 

0.93 

0.71 

0.40 

1.82 

384 

0.98 

0.77 

0.45 


fitting equation!^ gives the best-fit parameter of a = 28.95, 
P = —0.08, 7 = —5.21 and 5 = 0.29. Using the noise 
weighted flux (ASjnt = 0.2, AW = 0.4 MHz) and fit¬ 
ting equation gives the best-fit parameter of a? = 68.39, 
P = 0.45, 7 = —0.52 and S = 0.030. Since the noise changes 
throughout the cube, we found that equation]^ is the better 
way to minimise the effect of the different noise levels and 
we therefore used C{S*^tyW) when calculating the HIMF. 
For each galaxy in the sample we calculate the complete¬ 
ness coefficient by inserting the value for 5int, o and W 
into equation giving us a specific value for that galaxy 
which represents the probability of detecting this galaxy 
in the sample. Additionally we give the integrated flux at 
which a galaxy of a certain width reaches a completeness 
of C = 95%, 90%, 65% in Table As before, only galax¬ 
ies with completeness coefficients > 0 have been accepted 
in the sample (see Section [ 23 . Some of the galaxies with 
a completeness coefficient = 0 may in fact be real, but it is 
not possible to use them for statistical studies. 
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Figure 5. Top panel: H i masses of the AUDS galaxies as a func¬ 
tion of distance. The red dashed line indicates the minimal detect¬ 
able H I mass as a function of distance, assuming an integrated 
flux limit of 5int = O.SmJyMHz. Bottom panel: The black, solid 
histogram shows the distribution of detected galaxies. The red 
dashed line is the expected galaxy distribution derived by mul¬ 
tiplying the 2DSWML selection function by flD^ ADn. The blue 
dash-dotted line is the prediction using the selection function from 
the 1/V'max method. 



Figure 10. Comparison between the normalised numbers of 
galaxies in the ~ 4.2 deg^ sized SDSS fields SI and S2 (surround¬ 
ing to two AUDS fields) and the representative SDSS field with 
a total area of 5150deg^ (LI). Top panel: Comparison between 
Field 1 (blue line) and SDSS (black line), Centre panel: Compar¬ 
ison between Field 2 (green line) and SDSS (black line). Bottom 
panel: the relative density ratio for fields SI and S2 compared to 
the representative field LI. 


2.4 Reliability 


One method of estimating the reliability of an H l survey is 
to use optical information provided by other large surveys, 
e.g. SDSS. Another possibility is to re-observe parts of the 
survey area to assess the reliability of their sources and the 
measured parameters, done for example by HIPASS ( |Zwaan| 
et al.|[2004 |. However, neither of these methods are feasible 
for AUDS. The spectral density of SDSS in the area of the 
AUDS fields is too low to systematically cross-match all the 
optical and H l selected galaxies (Section |2.2[ ) and using ad¬ 
ditional telescope time for re-observations is not practical. 

As a first step to estimate the reliability we therefore 
looked at the overlap regions between the low and the high 
frequency bands. The cubes overlap in the frequency range of 
1368— 1382 MHz. In this overlapping region we find 5 galax¬ 
ies which are individually detected in both the high and the 
low frequency bands. There are two additional galaxies in 
the source list which are only detected in the high frequency 
bands. However, both these galaxies have a completeness 
coefficient C < 0.3. 

Next we estimated the reliability of our survey re-using 
the list of possible detections created from the data cubes 


with inserted synthetic objects (see Section 2.31. Of the 
total of 330 detected sources, 256 turned out to be syn¬ 
thetic, and 12 were previously detected AUDS galaxies. Of 
the 62 unidentified source candidates, about 10 passed the 
criteria for being included in our first cut candidate list as 
described in Section |2.1[ None of them passed our criteria 
to be included in the final catalog, otherwise they would be 


considered as one of the detected AUDS galaxies. We find 
that the number of such false detections is roughly propor¬ 
tional to the number of confirmed detections in any of the 
volumes. Therefore, we conclude that first cut catalogues 
contain on the order of 10/256 4% false detections, and 

the number of false detections in the final catalogue will be 
lower than that. We therefore consider 96% as a lower limit 
for the reliability of AUDS. The impact of false detections 
in AUDS is negligible for all results presented in subsequent 
sections. 


2.5 Cosmic Variance 

Measurements of the galaxy density in a finite volume are 
affected by the large scale structure of the Universe causing 
a bias in measurements in small volumes like AUDS. The 
sample variance originating from a finite volume is called 
cosmic variance. In order to quantify the cosmic variance 
of the density measurement, a quantity ^ can be defined as 
^[%] = 100 X (Tvar / {N) with the variance cr^ar = — 

Nif^/n with {N) being the mean galaxy count in the selected 
volumes, W the number of galaxies in the volume i, and n 
the total number of selected volumes. 

To estimate ^ in our sample, we selected 100 random 
fields in the SDSS North Galactic Pole field (using DR7) 
with the same size and redshift range as one of the AUDS 
fields. We then calculated the mean and standard deviation 
of the number of galaxies in these fields. To estimate the 
sampling error in ^ and Uvar we repeated that procedure 
1000 times. The number of galaxies per random field var- 
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Table 3. To correct our HIMF for the effect of cosmic variance 
we calculated the relative density in redshift bins for both AUDS 
fields individually. We compared the number of galaxies per deg^ 
in redshift bins in the small fields (S1,S2) around the AUDS 
fields and a large field (Al) to detect over- or under-dense regions 
in SI and S2. We derive the density ratio ps/PL of the AUDS 
field (Field 1, Field 2) in relation to a representative SDSS galaxy 
sample and its dependence on the redshift. 



Z 

PSi/PLl 

PSa/PLl 

0.0 - 

- 0.02 

0.316 

3.754 

0.02 

- 0.04 

1.004 

0.264 

0.04 

- 0.06 

1.835 

0.363 

0.06 

- 0.08 

0.147 

0.249 

0.08 

- 0.10 

0.430 

3.216 

0.10 

- 0.12 

0.453 

1.422 

0.12 

- 0.14 

0.439 

0.424 

0.14 

- 0.16 

0.230 

0.380 

0.16 

- 0.18 

0.350 

1.783 


ied between 10 and 335 galaxies with an average number of 
54(±2) ± 23(±3) galaxies or a sample variance of ^ = 42±5 
%. Doing the same test, selecting two AUDS sized fields as 
in the survey reduces the cosmic variance to ^ = 29 ± 4%. 
The decrease in cosmic variance corresponds to a reduction 
of y/2. Comparing these results to the information from our 
helds, we find that our fields are slightly denser than the 
average field of the same size in SDSS. We Hnd 58 galax¬ 
ies in Field 1 (1.1 times over-dense) 60 galaxies (1.2 times 
over-dense) in Field 2. 

We compare our results to those presented by |Driver| 


&: Robotham (20101. They derived an empirical expression 


for 5 for galaxy surveys using the spectroscopic redshifts 
of galaxies from the DR7 SDSS in the redshift range of 
0.03 < z < 0.1. They found that ^ is mainly influenced 
by the survey volume, the survey aspect ratio and whether 
the survey area is contiguous or consists of several independ¬ 
ent volumes. Using their equation we find ^ = 33% for both 
helds, or ^ = 46% which is within Icr of our result. 

The sample variation in the AUDS helds suggest that 
any density result will be correspondingly biased. However, 
a correction is possible as follows: 

(1) We selected a large sub-sample from the spectro¬ 
scopic DR7 SDSS galaxy sample (LI). LI is in the main 
SDSS held (130° < a < 236°, 0° < <5 < 58°) selected to 
maximise the area (5150 deg^) but to avoid the complex 
shape of the edges of SDSS (see Driver & Robotham 2010). 
Even though SDSS is not completely immune to cosmic vari¬ 
ance itself, LI is large enough that the expected difference 
between the mean density of LI and that of the Universe is 
~ 7% based on the results of |Driver &: Robothamj ( |2010[ ). 

(2) We selected all SDSS galaxies in small helds sur¬ 
rounding each of the AUDS helds. We name these smaller 
SDSS helds SI and S2. The area of SI and S2 needs to be 
larger than the area of the original AUDS helds to reduce 
Poisson noise due to the small number of galaxy counts, but 
small enough such that their density remains correlated with 
the density of galaxies in the AUDS helds. 

To hnd the optimum size of SI and S2, we placed 100 
AUDS-like volumes at random positions within SDSS. We 
then computed the average galaxy density of each AUDS-like 
volume. Around each of these 100 AUDS like helds we placed 
another held of a larger size. For each held, we calculated 


the ratio of the density of the AUDS sized held to the larger 
held surrounding it. Repeating this procedure for differently 
sized held 250 times, we found that the standard deviation 
of the density ratio is minimised (at 11%) when the size of 
the surrounding held is 4.2 deg^. That is, the quadrature 
sum of Poisson noise and cosmic variance are lowest for this 
held size. Furthermore, the density ratio itself indicates that 
S'! and S2 are representative of the structure in the smaller 
held. 

(3) We compared the number of galaxies per area in 
redshift bins in the small helds (SI, S2) and the large held 
(LI) to detect over- or under-dense regions in SI and S2 
(Figure [To{ Table |^. It is important to note that we make 
the assumption that the optically selected helds (S1,S2 and 
LI) have the same distribution of galaxies as the H l selec¬ 
ted AUDS sample. Looking at the respective redshift bins 
for each held we hnd the relative density of the small helds 
to vary between being 4.3 times under-dense and 3.8 times 
over-dense in comparison to the representative SDSS held 
(LI). Tracing the relative density in the AUDS helds in red¬ 
shift bins allows us to correct for the effect in the HIMF as 
described in Section (3.II 

For this correction we make the assumption that optical 
data and the H l data correlate. To test this assumption we 
compare the bias factor for the SDSS helds (51, 52, LI) 
with derived bias parameters for H l selected surveys. |Seljak| 


et al. (20051 measured the bias parameter for optical galaxies 


as a function of luminosity. Using their bias factors we hnd 
that the average bias factor of the galaxies in the 4.2 deg^ 
regions around the AUDS helds are: {bsi,S 2 ) = 0.97 ± 0.11 
while the LI held has an average bias of (fcii) = 0.99±0.06. 
Observations as well as numerical simulations estimate local 
bias parameters of the neutral hydrogen relativ to the dark 
matter between 0.7 and 1.0, with a typical uncertainty 
of±0.2 (Basilakos et al. 2007, Martin et al. 2012, Dave et 
al. 2013, see also Padmanabhan et al. 2015 for review). This 
shows that the clustering between optical selected galaxies 
and dark matter, and H l selected galaxies and dark matter 
are very similar on the spatial scales probed here, there¬ 
fore allow to use the optical data to correct for over- /under¬ 
densities in the H l selected galaxies. 


3 H I MASS FUNCTION (HIMF) 

3.1 Methods 

The HIMF 4?(Mm) is a measure of the number of galaxies 
per unit volume dUfor a given Mh i and is crucial input para¬ 
meter for models and simulations describing galaxy forma¬ 
tion and evolution. We derived the HIMF in co-moving co¬ 
ordinates to avoid changes in the measured densities purely 
caused by the expansion of the Universe. 

The HIMF is often parameterised by a Schechter func¬ 
tion dehned as: 

( \ a+i 

ALtr \ “h I 

^ e~^, (4) 

with the faint end slope a, the characteristic mass Mj) i and 
the normalisation >1>*. 

We use two different methods to derive the HIMF: The 
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Table 4. Comparison of different results for the HIMF from 21 cm surveys. 


Survey 



Reference 


a 

log (M*,/Mo) 
±2 log h 


AHISS 



Zwaan et al. 

19971 

- 1.2 

9.55 

14 

ADBS 

Rose 

nberg & Schn 

eider 

12002 

1 - 1.53 

9.63 

11.9 



Springob et al 

|2oA 

5 ' 

-1.24 

9.68 

9.3 

HIPASS 



Zwaan et al. 

loos 


-1.37 ± 0.03 

9.55 ±0.02 

14.2 ± 1.9 

ALFALFA 



Martin et al. 

(20lL 

1 

-1.33 ±0.02 

9.71 ± 0.01 

14.0 ±0.9 

AUDS 



This work 


-1.37 ± 0.03 

9.75 ±0.041 

7.72 ± 1.4 


1/^max method ( Schmidt] 19681 and the 2D stepwise 
maximum likelihood (SWML) ( Zwaan et al.|[2003 K 

The basic 1/llnax method assigns each galaxy a weight¬ 
ing factor which corresponds to the inverse maximum 
volume (Vliax) in which a galaxy can be detected inside the 
survey volume. 

We adapt this method to compute the maximum search 
volume using the relation for completeness in equation 
The completeness for a galaxy changes if the galaxy is shifted 
to a different part of the cube as the noise changes within 
the field and with frequency. 

We therefore create two additional data cubes of the 
same size as the original cubes. In the first one, each pixel 
value corresponds to the co-moving volume in Mpc^ corres¬ 
ponding to that pixel. In the second data cube each pixel 
value corresponds to the RMS noise of neighbouring pixels. 
The noise is computed by calculating the RMS noise in a 
±250km/s range around that pixel. For each galaxy, we 
computed the expected completeness Ci if that galaxy were 
placed at any of the pixels within the survey volume using 
equation]^ For that purpose, we scaled Sint and W to the 
distance corresponding to each pixel, and used the RMS of 
that pixel in the noise cube. The effective volume per pixel 
is then the product of completeness and volume Vi for each 
galaxy. 

The volume in which a given galaxy is detectable, here¬ 
after called the “detectable volume” is then the sum over the 
effective volume of each pixel 


Knax = Vi. 


(5) 


The co-moving HIMF is then defined as the sum over the 
Mhi range of all galaxies j in a Mhi bin with the bin size 
AMhi 

<I>(Mh.) = Ei7^- (6 ) 

V^max,? 

3 

In addition to the volume cube, we also created a 
cosmic-variance-corrected volume cube. For the cosmic- 
variance-corrected cube we multiplied each volume pixel in 
a redshift bin with the relative density of that redshift bin 
found in the optical sample (Table: 

Fmax,Cv[ai,22] = ^^^^^Rmax[2l,«2]. (7) 

Pl[zi,z\ 

That means that the volume of a pixel in an under-dense 
region is “shrunk” while the volume of a pixel in an over- 
dense regions is “enlarged”. The X] l/Knax method has the 
advantage of being fast and simple to implement as well as 
producing a normalised HIMF. 

The second method to calculate the HIMF is the step¬ 
wise maximum likelihood method developed by |Efstathiou| 



log Mhi [h-" M^] 


Figure 11. Top panel: The AUDS co-moving Hi mass func¬ 
tion measured using the Sl/Fmax method (black triangles) 
and Sl/VkiaxCV method (red diamonds), with their corres¬ 
ponding best-fit Schechter functions. The errorbars indicate the 
la uncertainties based on Poisson statistics. The black dashed 
line is the best-fit Schechter function for the Sl/Vkiax with 
a = -1.53 ±0.04, <!>* = (3.52 ± 0.80) x Mpc-^ and 

log(M^ ,/Mq) = (9.87 ± 0.02) ± 2 log h. The red dash-dotted line 
is the best-fit Schechter function for the Sl/Vlnax,CV method 
with a = -1.37 ± 0.03, = (7.72 ± 1.4) x lO'^ 

log(M^,/MQ) = (9.75 ± 0.041)±2logh. Lower panel: Histogram 
of H I masses, using the same binning as in the top panel. 


et al. (1988 \ as a superior tool to derive the luminosity func¬ 


tions of galaxies. The idea behind the SWML technique is 
to find the function <I>(Mh i) that yields maximal joint prob¬ 
ability of detecting all galaxies in the sample. It has the 
advantage over the 1/Knax method because the results are 
independent variables of density. In the SWML method the 
galaxy mass distribution is split into bins assuming a con¬ 
stant distribution in each bin. It is not necessary to assume 
a functional form for the HIMF. Based on SWML method. 


Zwaan et al. (20031 developed the 2 dimensional - SWML 


method which solves for the space density of Mu , and W at 
the same time. 
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Figure 12. Comparison between the HI mass functions derived 
with the Sl/VjnaxCV niethod (red diamonds and dashed line) 
and with the 2DSWL method (blue dots and dash- dotted line, 
respectively). The lines indicate the best-fit Schechter functions. 


3.2 E 


HIMF 


Figure shows the results of the HIMF calculation using 
the y] 1/Fmax method. In the following we will exclude the 
lowest mass data point in our fit as it only contains one 
galaxy with a low completeness coefficient (see Figure [^up¬ 
per left panel). Nevertheless, this galaxy and the two in the 
next bin are suggestive of a possible upturn in the HIMF . 

The errorbars were calculated using Poisson statist¬ 
ics not taking into account measurement errors (e.g. dis¬ 
tance and Mil,). The best Schechter fit to the model 
without correction for cosmic variance is a = —1.53 ± 0.04, 
log(MH,/M 0 ) = (3.52 ± 0.80) -I- 21ogh and <I>* = 

(3.52 ± 0.80) X 10“^ h~^ Mpc“®. The errors in the Schechter 
function are computed using a jackknife technique, de¬ 
selecting one galaxy at a time ( Quenouille|1949 1. 

Next we use the cosmic-variance-corrected volumes 
(Vmax.cv) to compute the HIMF (Figure 111. Both meas¬ 


urements show good agreement within the errorbars. The 
result for the best-fit to the cosmic variance corrected data is 
a = -1.37 ± 0.03, \og{M^JMQ) = (9.75 ± 0.041) -b 2 log h 
and <E>* = (7.72 ± 1.4) x 10"® Mpc“^ 

We binned the data in 0.5 dex bins of Mhi starting at 
an H I mass of 5.5. Different binning and starting masses 
give slightly different mass functions and fits. We estimated 
the error caused by this by varying the starting mass in the 
range of log(MH i/Mq) = (5 — 5.5) + 2 log h and the bin size 
by 2-5 bins per dex in Mui- The RMS over the best-fits was 
comparable to the fitting error {ac = 0.04, = 0.10 

and (Tj> = 4.6 x 10~^ Mpc“^ ). 


3.3 2DSWML HIMF 

The 2DSWML method is independent of density variations 
due to large scale structure and therefore a good, independ¬ 
ent test to check the results from the E l/Fmax method. It 
does, however, rely on the shape of the HIMF not changing. 
Furthermore, the difference in the noise at different locations 
within the survey volume cannot be taken into account. In¬ 


stead, we used a single value for the RMS for each detected 
galaxy. 

We recovered the normalisation for the HIMF using 
the mean galaxy density n. n is calculated by correct¬ 
ing the measured distribution of galaxies with the selec¬ 
tion function S{D) (Section |3. 3.1 1 . Davis & Huchra (19821 
presented several estimators to calculate h. We choose the 
Us = Ntotai/ / S'(D)d’F estimator as it is the most stable 
one for small numbers even though it has a slight depend¬ 
ency on large scale structure. 

Figure compares the 2DSWML HIMF and the 
El/V'max,cv HIMF. We find that the data points for both 
methods are in reasonably good agreement at the high mass 
end (Mhi > lO^'^M©) with each other. At lower masses 
the slope of the 2DSWML (red dashed line) is less steep. 
Even though most of the data points agree with each other 
within the Icr errorbar the 2DSWML HIMF points are sys¬ 
tematically lower causing a significant difference in the fit¬ 
ted slope. The best-fit to the 2DSWML HIMF function 
yields = (9.82±5.40) x 10~® h® Mpc“^ log(MH,/M 0 ) = 
(9.70±0.07) 4-2 log h and a = —1.14±0.10. The good agree¬ 
ment between the normalisation of the 2DSWML and the 
E 1/Vjnax,cv HIMF is encouraging, as the normalisation for 
the 2DSWML HIMF was calculated independently. 


3.3.1 Selection Function 


The selection function S{D) is the probability that a galaxy 
at a distance D is detected by the survey. We calculate S{D) 
for the 2DSWML and the 1/Knax HIMF as described in 


Zwaan et al. (20031. 


Assuming a homogeneous space distribution of galaxies, 
the number of galaxies observed in a distance bin of the 
size AD at the distance D is n{D) = Q. DAD n S{D) with 
the solid angle Q. and the average number of galaxies n. 
Figure compares the detected and the predicted redshift 
distribution showing that they are not in good agreement. 

The galaxy numbers derived by the selection function 
give lower numbers of nearby galaxies while overestimating 
galaxies at larger distances in comparison to the detected 
galaxy distribution. The effects could be caused by large 
scale structure which are not traced by the selection func¬ 
tion. A good example is the over-density at D « 270 Mpc in 
the AUDS histogram which does not show up in the predic¬ 
tion. A possible explanation for the over-estimation of high 
redshift galaxies lies in the general limitation of AUDS to 
pick up galaxies at high redshift largely caused by RFI, as 
described in Section (2. II 


3.4 Influence of Completeness 


We rate the quality of our detected galaxies by their com¬ 
pleteness coefficient C and only include galaxies with C > 0, 
excluding the lowest mass galaxy for the HIMF fit. In Fig- 


13 we compare the 1/Unax,cv HIMF for different cuts 


in the completeness coefficient {C = 0.4,0.5, 0.6). The plot 
shows the lower mass galaxies (M < are excluded 

first causing the slope to flatten. High completeness cut¬ 
offs (C > 0.5) exclude galaxies with masses around the 
knee of the HIMF resulting in smaller values for the nor¬ 
malisation I>*. The change in the HIMF with completeness 
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Table 5. Comparison between the best Schechter fit as well as the values for for the derived AUDS HIMF. 


Methods 

a 

log(M*,/M 0 ) 
+2 log h 

(10“^ Mpc“^) 

Hhi (int-) 
(10-4 h-i) 

r^Hi (sum.) 

H 1/Umax 

S 1/Unax, CV 
2DSWML 

-1.53 ±0.04 
-1.37 ± 0.03 
-1.14± 0.10 

9.87 ± 0.02 
9.75 ±0.041 
9.70 ±0.07 

3.52 ± 0.80 

7.72 ± 1.4 

9.82 ± 5.40 

1.76 ±0.09 
2.22 ±0.10 

1.96 ±0.51 

1.63 ±0.08 
2.33 ±0.07 
1.94 ± 0.49 



Figure 13. The l/\4aax,CV HIMF calculated for galaxies selected with different completeness cutoffs. The completeness equation ^ 
was used to measure the completeness coefficient C for each galaxy. Plots for C > 0, 0.4,0.5 and 0.6 are shown. The corresponding 
numbers of galaxies are 102, 98, 96 and 92. The Schechter fits are shown as the black dotted lines with the result of the fitted slope 
in the top right corner of each panel. The red solid line is the Schechter fit from Figure ini based on all galaxies with C > 0. The red 
histograms show the HI mass distribution of the full AUDS sample and the black histograms are the corresponding distributions for the 
restricted samples. 


is an intriguing result as unlike HIPASS and ALFALFA, the 
low-mass AUDS galajdes are located well beyond the local 
volume and may represent more typical volumes in the Uni¬ 
verse. 


3.5 Evolution of the HIMF 


The redshift range of AUDS allows us to split the sample 
into redshift bins to see evolutionary trends. We split our 
sample at the mean redshift of our sample (2 = 0.065). This 
creates a lower redshift bin with 52 galaxies and a mean 
redshift of 2 = 0.036 and higher redshift bin with 50 galaxies 
and a mean redshift of 2 = 0.095. The binning by redshift 
indirectly means we divide the galaxies by their mass, as 
the faint galaxies can only be detected close by while rarer, 
massive galaxies need the large volume of the high redshift 
bin to be found. 

However, there is some overlap ove r a mass range of 10* 
to 10^^ Mq as can be seen in Figure [l4| There appears to be 
evidence for only a modest change in the HIMF. Formally, if 
a and , are held fixed at their best-fit value for the whole 
sample, >1>* is ~ 16% lower for the higher redshift sample. 
Any change in the density of galaxies with H l mass > 10^° 
or < 10* Mq cannot be explored. The full AUDS sample will 
provide tighter limits on the amount of evolution that can 
arise from feedback processes in galaxies over this redshift 
range (Kim et al.|2013|. 


3.6 Comparison of HIMF 


The HIMF is a useful tool to describe how much H l is locked 
up in galaxies. The slope of the HIMF gives the relative 
importance of low mass and high mass galaxies. An early 
HIMF measured from the AHISS survey ( Zwaan et al.|1997 1 
found a flat faint end slope of a = —1.2. The Arecibo Dual- 
Beam Survey (ADBS) | Rosenberg fc Schneider]2002 1 found 
a much steeper slope of a = —1.53. However, both these 
surveys suffer from small numbers and small volumes. 

The two largest blind H l surveys HIPASS and AL¬ 


FALFA (40%) find slopes a = —1.37 ± 0.03 (Zwaan et al. 


20051 and a = —1.33±0.02 (Martin et al. 20101 respectively. 


The good agreement between the two surveys seems to sug¬ 
gest that the slope of the HIMF in the local Universe is well 
defined. However, at the high mass end the ALFALFA sur¬ 
vey reveals a larger number of galaxies than HIPASS (Fig- 


151. Martin et al. (20101 explains this difference with the 


higher upper redshift limit and larger volume in comparison 
to HIPASS. 


The AUDS sample allows us to construct an HIMF for 
the first time which is independent of the local volume and 
at much higher redshifts and to higher sensitivities than pre¬ 
vious surveys. Figure compares the results from AUDS 
with the best-fit of HIPASS and ALFALFA. AUDS meas¬ 
ures a slightly steeper slope a than HIPASS or ALFALFA, 
but overall the surveys agree well with each other. We find 
a tentative rise at very low masses caused by faint, low- 
mass galaxies detected in our survey, which might have been 
missed in previous, less sensitive surveys. Twice as many 
galaxies are detected compared to the prediction from the 
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Figure 14. Top panel: Comparison between the HIMF derived 
from the low redshift (red diamonds) and high redshift (black tri¬ 
angles) sub-sample. The blue dashed line is the fit to the complete 
AUDS sample (Figure 11). In the mass range of 10^ — 10^^h~^M q 
the low redshift HIMF is significantly higher than the high red¬ 
shift HIMF. The discrepant high redshift bin at Mq 

only contains one galaxy. Lower panel: The number distribution 
of the low redshift sample [{z) = 0.036) in red and the high red¬ 
shift sample {{z) = 0.094) in black. 


[h| 



Figure 16. The distribution of the H i mass density as a func¬ 
tion of Mpj I. The H i mass density is derived by multiplying the 
HIMF by the centre of each Mh i mass bin. The red diamonds and 
dashed line show the mass density derived from the Sl/\4nax,CV 
method and the black diamonds and dashed line correspond to 
the 1/Vmax method. The blue line indicates the H i mass dens¬ 
ity from HIPASS. The comparison shows that for AUDS there is 
a slightly higher contribution from both faint and bright galaxies 
to the overall H i density. 



log Mhi [h-^ Mg] 


Figure 15. Comparison between the AUDS HIMF (red diamonds 
and red dashed line) and the best-fit Schechter functions from 
HIPASS (blue solid line) and ALFALFA (green dot dashed line). 
The individual data points of the HIMF agree (within the error- 
bars) with the two surveys in the local Universe, apart from a 
small number of low H i mass AUDS galaxies. 


extrapolated HIMF. Unfortunately the overall small-number 
statistics make it necessary to interpret this result with cau¬ 
tion. Although the detected faint galaxies are well beyond 
the Local Group, the volume sampled at this mass level is 
only about 8 h~^ Mpc®, so the cosmic variance is high. 


4 COSMIC Hi DENSITY Hhi 


4.1 Hi Mass Density(pHi) and Cosmic Hi 
Density (flm) 

Figureshows the Hi mass density pm for different Hi 
masses; we compare the results before and after cosmic vari¬ 
ance correction. The lines indicate the best-fit to pH i using 
equation The measured slope of the HIMF has import¬ 
ant implication for the contribution to the H l mass density 
Phi of low-mass galaxies. The measured slope of the AUDS 
sample that the gas mass density is dominated by galax¬ 
ies with masses around 10^'^h~^M q corresponding to the 
knee of the HIMF. Comparing the results of AUDS and 
HIPASS (Figure [I^ shows differences between the two sur¬ 
veys. AUDS detects more galaxies at the low mass end and 
also detect slightly more galaxies at the very high mass end. 

The total H l mass density can then be computed by 
integrating the best Schechter fit using pui = r(a -|- 2) x 
<I>*Mh,, where F is the Euler Gamma function and a, 4>* 
and Mh i the fit parameters from the Schechter fit. This gives 
Phi = (6.24 ± 0.23) X IO^/iMqMpc”® for the y 1/Vmax 
method and pui = (6.53 ± 0.31) x IO^/iMqMpc”® for 
y 1/Umax, cv- We estimated the error using the jackknifing 


technique as with the HIMF calculation (Section 3.21. 


In addition to this, we can also calculate pui by sum¬ 
ming over individual data points in the H l density distribu¬ 
tion. The results are pui = (4.46 ± 0.21) x IO^/iMqMpc”® 
and phi = (5.78 ± 0.20) x 10^ h MqMpc~^ for the uncor¬ 
rected and corrected values respectively. The results of the 
summation and integration of pn i agree within the la error- 
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Redshift 2 


Figure 17. Measurements of Hh i for different redshift bins in 
AUDS (black points). Note that not all data points are inde¬ 
pendent from each other but overlap in redshift (see Table 5). 
The mean redshift of the subsamples are z = 0.009 (triangle), 
z = 0.036 (square), z = 0.065 (diamond), z = 0.095 (circle) and 
z = 0.127 (hexagon). Comparing the results to HIPASS (blue 
right pointing triangle) and the AUDS precursor (red left pointing 
triangle) shows little evolution in the measurements from z = 0 
to z = 0.16. 


bars indicating that our survey was able to adequately probe 
below the knee of the HIMF to capture most of flni- 

To compare our results to other measurements we com¬ 
pare the co-moving H l density, pn i to the current {z = 0) 
critical density of the Universe, Pcrit.o, to derive the cosmic 
H I density 


fin I 


Ph I 

Pcrit{z = 0) 


SttG 


( 8 ) 


where G is the Gravitational constant and Hq is the 
Hubble constant at z = 0. Note that the dehnition of 
fin I, consistent with previous work, simply scales the co¬ 
moving density by the current critical density and not 
by the co-moving, redshift dependent critical density. We 
find Hhi = (1.63 ± 0.08) x before correction and 

fin I = (2.33 ± 0.07) x after cosmic variance correc¬ 

tion, summing up the data points. As noted in Section [2.5| 
the cosmic variance correction has formal uncertainties of 
8 % in piggy-backing to the larger 4.2 deg^ SDSS field (11% 
for a single field, 8% for two fields) and 7% for SDSS as 
a whole (Driver & Robotham 2010), giving rise to a com¬ 
bined systematic uncertainty of 11%. However, there are ad¬ 
ditional factors due to differing bias factors for H l and op¬ 
tical surveys, and unknown stochasticity factors which may 
raise this overall uncertainty (see Chang et al. 2010) which 
are neglected in this paper. 


4.2 Evolution of the Cosmic H l Density Hh i 

To trace the evolution of cool gas with cosmic time we split 
our sample in different bins of redshift. For each sub-sample 
we derived the HIMF and calculated Hhi- As the number 


Table 6. Overview of measurements of Hh i up to a redshift of 2 : = 
0.2. We calculate a weighted mean of the individual measurement 
to find a universal value of Hh i for this redshift range. 


Reference ( 2 ) fljr, Method 

_ 


Zwaan et al.| 

20051 


0.015 

2.6 ±0.3 

Sources 

Martin et al. 

12010 


0.025 

3.0 ±0.2 

Sources 

i't’eudline: et i 


(^ 2 o 

11 

1 0.111 

3.74 ± 1.70 

Sources 

Delhaize et al. 

2013 

0.028 

Q qq-|-U.i 3 U 

^•^^-0.59 

Stacking 

Delhaize et 

al. 

2 

013 

0.096 

3 19+0-43 

Stacking 

Hhee et al. 

(20 

13 

■ 


0.1 

2.3 ±0.4 

Stacking 

Hhee et al. 

(2013 

■ 


0.2 

2.4 ± 0.6 

Stacking 


This paper 0.034 3.68 di 0.39 Sources 

This paper 0.095 1.93 ±0.19 Sources 


Best combined estimate 0 — 0.2 2.63 ± 0.10 


of the galaxies in the redshift bins are relatively small we 
decided to fit the HIMF with a Schechter function keep¬ 
ing and a fixed using the results we found for the 

X) l/rmax,cv and only fit the normalisation $*. Hhi is then 
calculated by integrating over the Schechter function. First 
we split the sample in two redshift bins at the mean red¬ 
shift of the sample (( 2 ) = 0.065), creating two samples with 
mean redshifts of ( 2 ) = 0.036 and ( 2 ) = 0.095. We find 
Dhi = (3.68 ± 0.39) X10“'*for the low redshift sample 
and Hhi = (1.93 ± 0.19) x 10“*/i“* for the high redshift 
sample. The results indicate a possible decrease in Hhi to¬ 
wards the upper end of the redshift range of the sample. 

Next we selected the 8 highest redshift galaxies between 
the redshifts of 0.119 and 0.132 to probe the high red¬ 
shift end of our sample. Integrating over the Schechter 
function we find Hhi = (2.24 ± 0.78) x 10“*h“*. Due to 
the small numbers of galaxies in this bin Poisson scatter 
is the dominant source of error. We also selected a low 
redshift with the 8 lowest redshift galaxies. The result is 
flni = (5.33 ± 1.89) x 10“*/i“* agreeing well with results 
from HIPASS and ALFALFA as well as the AUDS lower 
redshift bin. 

The results for Dhi for different redshift bins are sum¬ 
marised in Figure as well as Table Note that the meas¬ 
urements of the 8 high and 8 low-redshift galaxies are not 
independent of the data binned at the mean redshift of the 
sample. It appears likely from this comparison that the low 
redshift points are high compared with those at higher red¬ 
shift. The HIPASS result agrees better with the high red¬ 
shift points suggesting that there is no evolution detected 
and that the low redshift results may be subject to cosmic 
variance errors. 


4.3 Discussion 

Measuring Hh i and its evolution with redshift has long been 
an important scientific question. 21 cm measurements at 
low redshift provide a good constraint for Hhi in the local 
Universe. Beyond that measurements have been more diffi¬ 
cult. Until very recently there has been a huge gap between 
these local measurements and measurements at high redshift 
2 > 1.5 using DLA measurements. Moreover, this period is 
marked by a significant change in star formation rate and 
therefore interesting for galaxy evolution studies. Successful 
attempts have been made using the stacking technique to 
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Table 7. Results for Qh i binning the sample in different redshift 
bins for AUDS. 



(^> 

Number 

log 

^ M© 

+2 log h 

Oh 1 

(lO-^h-i) 

0 - 0.013 

0.009 

8 

6.3 - 9.3 

5.33 ± 1.89“ 

0 - 0.065 

0.036 

52 

6.3 - 9.5 

3.68 ±0.39*’ 

0.065 - 0.132 

0.095 

53 

8.4 - 10.3 

1.93 ±0.19*’ 

0.119 - 0.132 

0.127 

8 

8.9 - 10.3 

2.24 ± 0.78“ 

0 - 0.132 

0.065 

102 

6.3 - 10.3 

2.33 ±0.07*’ 


“ Errors derived using Poisson statistics. 
** Errors derived using jackknifing. 


bridge that intermediate redshift gap (Rhee et al.|2013 Del- 


|haize et al.|2013]) as well as the intensity mapping technique 
( [Chang et al. 2010 Masui et al. 2013 \ at higher redshift 
(Figure 18 1 . 

AUDS is the first survey which begins to probe this red¬ 
shift range using direct detections (Figure 18 1 . The AUDS 
results combined with the low redshift 21 cm surveys im¬ 
ply only limited, if any evolutionary effects out to z = 0 . 2 , 
corresponding to a look-back time of 1.7 h~^ Gyr. 

We use these measurements (Table to calculate a 
weighted average for Hh i for 0 < 2 < 0.2. Weighing each 
measurement by its error, we find flni = (2.63 ±0.10) x 

as the 
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The result is presented in the Figure 
grey shaded region indicating the 1 a level. The black dashed 
line, also presented in Figure jlSj shows the results from semi- 
analytic models presented by Lagos et al. (20141, using their 
model as described in [Lagos et al.| ( |2012 l. They also find a 
very weak increase in Hhi over this redshift range, in agree¬ 
ment with the observational result for Dhi. 


5 CONCLUSIONS 

In this paper we present early results from AUDS, a deep 
blind 21 cm line survey of two selected fields in the red¬ 
shift range between 0 and 0.16 with a sensitivity of 80 /iJy. 
We detected a total of 102 galaxies with masses from 
log(MHi/M 0 ) — 21og/i = 5.6 — 10.3. We used synthetic 
galaxies to derive a completeness function based on 5int, 
W and a. We used the SDSS DR7 to correct the sample for 
cosmic variance and derived a X] V^ax.cv HIMF which 
is well fit by a Schechter function with the parameters: 
a = -1.37 ±0.03, = (7.72 ± 1.4) 10“®li®Mpc“® and 

log(Mj*[,/M) = (9.75 ± 0.041) ± 2 log ft, a result which is 
in good agreement with results from local surveys (HIPASS, 
ALFALFA). 

The co-moving H l mass density at the mean redshift 
of the sample is pn i = (5.78 ± 0.20) x 10^ h MqMpc”^ con¬ 
tributing a fraction Dhi = (2.33 ± 0.07) x 10~'*h“^ of the 
critical density of the Universe. The depth of the survey al¬ 
lows for the first time the derivation of both the shape and 
normalization of the HIMF from a blindly surveyed volume 
outside of the local Universe. In the volume that excludes 
the local Universe, at z > 0.06, AUDS probes the mass 
range from log(MHi/M 0 ) — 21og/i = 8 — 10.3. The derived 
HIMF is indistinguishable from that derived from local sur¬ 
veys. These observations constitute strong evidence that the 
HIMF did not rapidly evolve in the last billion years. 

At redshifts up to 0.005, AUDS probes the HIMF at 
masses as low as log(MH i/Mq) = 5.6 ± 2 log h. We detected 


twice as many galaxies as predicted from the local HIMF 
for log(MH i/Mq) < 7 ± 2 log h. This might be an indication 
that the HIMF rises more steeply than previously thought 
at the very low mass end. If correct, this finding implies that 
the fraction of Mhi contributed by low mass galaxies may 
be more significant than previously appreciated. 
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Figure 18. Evolution of the cosmic density with redshift. The colour corresponds to the type of measurement. Blue: blind 21 cm 
surveys, Magenta: H i stacking, Cyan: Intensity mapping, Green: Measurements from Ly-a absorption spectra, Red: AUDS. We present 
the result using the complete AUDS sample (single thick diamond) as well as the binned results (thin diamonds). Observations show no 
significant evolution in Qh i out to z = 0.2. Calculating the best combined estimate based on all measurements of fin i out to this redshift 
we find the la interval marked in grey. The black dashed line is the prediction presented by [Lagos et al.| | |2014[ |, using semi-analytic 
models described in [Lagos et al. ]< |2012| l. 
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